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CONSISTENCY PROPERTIES OF A SIMULATION-BASED 
ESTIMATOR FOR DYNAMIC PROCESSES 1 

By Manuel S. Santos 
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, University of Miami and Universidad Carlos III de Madrid 

This paper considers a simulation-based estimator for a general 
class of Markovian processes and explores some strong consistency 
. properties of the estimator. The estimation problem is defined over 

a continuum of invariant distributions indexed by a vector of pa- 
rameters. A key step in the method of proof is to show the uniform 
■ convergence (a.s.) of a family of sample distributions over the domain 



Oh 



of parameters. This uniform convergence holds under mild continuity 
and monotonicity conditions on the dynamic process. The estima- 
• tor is applied to an asset pricing model with technology adoption. A 

| challenge for this model is to generate the observed high volatility 

of stock markets along with the much lower volatility of other real 
economic aggregates. 



■ 1. Introduction. Simulation-based estimation is advocated in several ap- 
C*~> [ plied areas of economics and finance (e.g., [6, 12, 15]), but relatively little is 

known about asymptotic properties of these estimation methods. Our pur- 
. pose here is to establish some strong consistency properties of a simulation- 

based estimator. Although the estimator seems highly specific, our results 

■ should be of broad application to other types of simulation-based estimation 
. methods. The estimator is applied to a macroeconomic model of technology 

adoption where some parameters are hard to estimate by other methods. 2 In 
this model, news about the arrival of new technologies will suddenly impact 
the stock market because of their option value in the creation of new prod- 
5— i ' ucts and designs. A challenge for this model is to reconcile the volatility of 
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^This is actually a typical situation, since models are abstractions of reality and so 
their parameters cannot usually be estimated from raw data. 
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real economic aggregates (e.g., worked hours, consumption and investment) 
with the much greater volatility of the stock market. Traditional business 
cycle models fail to offer a joint explanation for the fluctuations of the real 
and financial sectors. Hence, it becomes of interest to search for the best 
fit of our model and check if it has the ability to replicate volatilities along 
these two dimensions. 

For simplicity, our theoretical analysis centers on the following parame- 
terized family of random dynamical systems: 

(1.1) s n = <p(s n -i,e n ,9), n = l,2,.... 

Stochastic equations of the form (1.1) often arise as solutions of various dy- 
namic models in biology, economics, physics and other applied disciplines. 
In the sequel, s n is a finite vector of state variables, e n is a vector of i.i.d. 
shocks and 9 is a finite vector of parameters. In many applications, it be- 
comes crucial the estimation of the vector of parameters 9 by a reliable 
procedure. Traditional data-based estimation methods are of limited appli- 
cability for nonlinear stochastic systems. First, the estimator may require 
optimization of the likelihood function, or of an associated objective, and 
such calculation may not be computationally feasible. Indeed, function <p 
may be unknown, and usually has to be computed numerically. Besides, 
both the vector of shocks e and some state variables s may be unobserv- 
able. 3 Second, commonly used goodness-of-fit criteria, such as the minimum 
size of the squared residuals, can be quite uninformative on the dynamic 
properties of the selected model. Simulation-based estimators can target di- 
rectly the moments of an invariant probability or some other critical aspects 
of the dynamics. 

The informational requirements for the implementation of our estimator 
are quite weak, and stem from the ability to simulate the model. Thus, a 
researcher must be able to evaluate function <p (or have access to a computer- 
generated law of motion) as specified in (1.1), and take a stand on the func- 
tional form of the probability law for the shock e. (Vector 9 may encompass 
parameter values for the distribution of e.) Therefore, the functional form 
of ip and the actual sequence of realizations {e n }n>i may both be unknown. 

The proofs of consistency of the estimator deal with the nonlinear dy- 
namical effects of the vector of parameters 9, which feeds into the evolution 
of sample paths {s n } n >i for a fixed initial condition sq and a sequence of 
shocks {e n }n>i- Indeed, the estimation problem is defined over a continuum 
of invariant probability measures [Iq which vary with parameter 9, and the 



3 These unobservable state variables are often called "latent variables." Note that func- 
tion ip could be nonseparable in s and e, and the ergodic sets and distributions of the 
stochastic dynamical system (1.1) will depend on parameter vector 9. 
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estimated parameter values are selected over a continuum of sample distri- 
butions. In contrast, traditional estimators [16, 20] select values from the 
unique invariant distribution of the data generating process. A key step be- 
low is to demonstrate the uniform convergence of a continuum of sample 
distributions for every initial condition so an d almost all sequences of real- 
izations {e n }n>i- This result is of independent interest in probability theory. 
As in usual proofs of consistency, the result amounts to a uniform law of 
large numbers, but for our simulation-based estimator the convergence is 
over a continuum of invariant distributions parameterized by vector 9. This 
uniform law of large numbers is shown to hold under mild continuity and 
monotonicity conditions. We leave for future investigation extensions of this 
analysis to nonmonotone dynamical systems. 

The paper contributes to several strands of the literature. There are var- 
ious related results on uniform convergence of invariant distributions for 
families of functions (e.g., [9, 25, 31]), but these results fall short of what 
is required in the present case since they are restricted to a fixed empir- 
ical distribution. Monotone Markov processes are studied in [2] and [10]. 
The simulation-based estimator is set forth in [11] and [21]. Available proofs 
on consistency of this estimator require uniform continuity and contractiv- 
ity conditions [29]. As discussed below, noncontinuous decision rules are of 
considerable interest in economics. Finally, many applications make use of 
simulation-based estimation (e.g., see [14] and [22] for two recent examples); 
however, in general these papers do not get into the analysis of asymptotic 
properties of the estimator. 

2. Assumptions. For convenience of the presentation, the set of states 
S is a hypercube in Euclidean space TZ k , that is, S = {s = (..., Sj, . . .) : Oj < 
Si <bi} for given constants ai < bi and all i = 1, . . . , k and > is the usual 
Euclidean (partial) order. S denotes the relativized Borel c-algebra of S. 
The vector of shocks e follows an i.i.d. process defined by a probability law 
Q on a measurable space (E,£). The domain of parameter vectors C 1Z l 
is a compact set. We say that a function h in S is increasing if h(s) > h(s') 
for all s> s' . 

Function (p: SxExQ^-S satisfies the following assumptions: 

(A.l) Measur ability. Function ip: S x E x Q — > S is measurable on the prod- 
uct space 5x^x9. 
(A. 2) Monotonicity. For each (e,9), mapping (p(-,e,9) : S — > S is increasing. 

Note that no order preserving conditions are required over space E x Q. 

(A. 3) Feller property. For every fixed 9 and every continuous function / : S — >• 
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(2.1) 
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This weak form of continuity over S is usually assumed to guarantee exis- 
tence of an invariant probability fig for mapping (p(-,-,9); see [1], Chapter 
1. 

We also need some type of continuity on the domain of parameter vec- 
tors O. Since mapping (p(-,-,9) may not be continuous on S x E, the con- 
tinuity in 9 is defined with the help of some majorizing and minorizing 
functions ip K and <p K . Let e denote the unit vector (1,1,..., 1) and k > 
0. Then ip K (s,e,9) = proj s [ip{proj s [s + Ke],e,9) + Ate] for all (s,e), where 
s' = proj s [s + Ate] means the natural projection or minimum Euclidean dis- 
tance of vector s + Ate to set S. The projection mapping just ensures that 
function (p K is well defined. Since function tp is monotone in s and the pos- 
itive vector Ate is added to both vector s in the domain and to the value 
<p(s',e,0), we get that Lp K (s,e,9) > (p(s,e,9). We write <p K (-, -,9) > <£>(•, -,9) 
to express functional dominance over > . In the same way, we define function 
(p K (s,e,0) = proj s [ip(proj s [s — Ke],e,9) — Ate]. Functions (p K and (p K will be 
further discussed below [Section 5, remark (iv)]. 

(A. 4) Continuity in the parameters. For every 9 and At > 0, the set {9'\ip K (-, ■, 
9) > tp(', -,9') > (f K (-, -,9)} contains an open neighborhood V K {9) of 9. 

Observe that (A.1)-(A.3) will all be satisfied if (p is a continuous function 
over a compact domain S x E x 0. Moreover, (A. 4) would hold trivially un- 
der the upper and lower envelope functions ^> k (sq, £, 9) = sup g > £ y K ^ ip(so,e, 9') 
and ip K (s ,e,6) = mi e , eV ^ e) ip(s ,e,9'). 

(A. 5) Uniqueness of the invariant probability. For every 9, mapping </?(•, -,9) 
has a unique invariant probability 

Certain conditions guarantee (A. 5), for example, [2] and [13]. This assump- 
tion will simplify the analysis considerably, and it is necessary to obtain 
our global convergence results. These results can suitably be extended to 
account for multiple invariant probabilities; e.g., see Lemma 4.2 below. 

Let || • || be the max norm in K l . Then for every fixed 9, we define the 
distance between mappings tp(-,-,9) and <p(-,-,9) over S x E as 



The above assumptions ensure that the invariant probability fig is always 
well defined and weakly continuous in 9. Moreover, the maximum in (2.2) is 
always attained at some s. It should be emphasized that the standard sup 
norm in the space of functions ip(-, -,9) will be very restrictive since tp(-, -,9) 
may not be continuous in (s,e). 
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3. The simulated moments estimator (SME). 

3.1. Model simulation. Let us first place ourselves in a simple framework 
of model simulation. Assume that a researcher can evaluate function ip at 
any given point (s, e, 9), and can draw sequences {i n } n >i from a random gen- 
erator that mimics the distribution of {e n }n>i- No knowledge of the actual 
realization of the shock process {e n }n>i is required. Later, the analysis is 
extended to the more typical situation in which the researcher can only per- 
form evaluations of a numerical approximation ip 3 . Hence, for each parameter 
value 9 and initial condition so one can generate sequences {s n (so, ui, 9)} n >i 
using dynamical system (1.1); that is, s n (so,uJ,9) = (p(s n -i(so,u),0),e n ,9) 
for all n > 1. It should be emphasized that in order to search for the true 
value 9° we need to sample over the whole parameter space 0. 

3.2. Probability spaces. Let s = {s n } n >i be a sample path of observations 
of the data generating process. Let u = {e n }n>i be a sequence of realizations 
of the shock process. A measure 7 is defined over the space of sequences s = 
{s n }n>i- Also, from the probability law Q a measure 7 can be constructed 
over the space Q of sequences ui = {e n }n>i- Let A = 7 x 7 represent the 
product measure. 

3.3. The SME. Our definition of the SME is conformed by the following 
elements: 

(1) A function of interest f : S — >• W assumed to be continuous and in- 
creasing. Then the expected or mean value Eg(f) = J f(s)fig(ds) may 
represent p moments of an invariant distribution fig. The expected value 
of / over the invariant distribution of the data generating process will be 
denoted by /. The restriction to increasing functions is rather harmless. 
Indeed, following [32] the subclass of continuous and increasing func- 
tions determines convergence in the weak topology of measures; that is, 
a sequence of probability measures {fi n }n>i converges weakly to mea- 
sure fi if and only if J f(s)fi n (ds) — > n f f(s)fi(ds) for every continuous 
and increasing function / : S — > 1Z. 

(2) A distance function G:1Z P x TZ P 1Z assumed to be continuous. The 
minimum distance is attained at a vector of parameter values 

(3.1) 9° = axgmmG(E g (f),f). 

Under the above assumptions, there exists an optimal solution 9°. To 
facilitate the presentation we suppose that 9° is unique. 

(3) An estimation rule characterized by a sequence of distance functions 
{G7v}at>i and choices for the horizon {ttv}tv>i of the model's simu- 
lations. This rule yields a sequence of estimated values {9n}n>i from 
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associated optimization problems with finite samples of model's simula- 
tions and data. The estimated value 6n(sq,u,s) is obtained as 

/ ^ T N (U),S) 

6 N (s ,u},s) = arg inf G N \ — - — — V] f(s n (s ,u),6)), 

(3.2) 

1 N 

n=l 

We assume that the sequence of continuous functions {Gjy(-, -,o;,s)}jv>i 
converges uniformly to function G(-,-) for A-almost all (w,s), and the 
sequence of functions {tn(u, s)}yv>i goes to oo for A-almost all (w,s). 
Note that both functions Gn(-,-,uj, s) and tn(u, s) are allowed to depend 
on uj and s. These functions will usually depend on all information 
available up to time N. The rule tn reflects that model's simulations 
may be of a different length than data samples. 

It should be stressed that problem (3.1) is defined over population charac- 
teristics of the model and of the data generating process, whereas problem 
(3.2) is defined over statistics of finite simulations and data. 

Definition. The SME is a sequence of measurable functions {9n{sq, uj, 
s)}tv>i such that each function On satisfies (3.2) for all so and A-almost all 
(w,s). 

By the measurable selection theorem ([8], Chapter 2), there exists a se- 
quence of measurable functions {9n}n>i- 

4. Main results. As stressed above, the proof of consistency of the SME 
has to deal with a continuum of invariant probabilities. The strategy is to 
show that minimization problem (3.1) can be approximated by a sequence 
of optimization problems (3.2) for A-almost all (w,s). The hardest step in 
the proof is to demonstrate the uniform convergence (a.s.) of the sequences 

{ •nvcL.s) En=i J ' S) /(gn(so,^,6 ) ))}jv>i to E e (f) over 6, as A^oo. This uni- 
form convergence of the simulated statistics follows from a sandwich argu- 
ment that builds upon the weak continuity of the invariant probabilities of 
functions ip K and cp K under perturbations in k (Lemma 4.1), a generalized 
law of large numbers for each individual function (Lemma 4.2), and the 
order preserving property of (p as stated in (A. 2). These lemmas draw on 
some results in [30] on pointwise convergence properties of invariant prob- 
abilities for random dynamical systems satisfying the Feller property. We 
extend these results to establish uniform convergence over of the sample 
distributions (Proposition 4.3). Proofs are gathered in the final section. 
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Let fig be an invariant probability of function ip K (-,-,9), and fi K g an in- 
variant probability of function (p K (-,-,9). 

Lemma 4.1 (Continuity of the invariant probabilities). Assume that the 
sequence of positive scalars {i^j}j>i converges to zero. Then under (A.l)- 
(A.5) every sequence of invariant probabilities {/£% J }.,>i (resp. {fi K j9}j>i) 
converges weakly to the unique invariant probability fig of mapping (p(-,-,0). 

Note that our primitive assumptions (A.1)-(A.5) do not rule out the 
possibility that the auxiliary functions (p K (-,-,9) and (p K (-,-,8) may con- 
tain multiple invariant probabilities. Hence, let I(fig) be the set of all the 
invariant probabilities fig of (p K (-,-,9). Prom the proof of Lemma 4.1, the 
set T(fig) is compact and convex in the weak topology of measures. Then 
every continuous linear functional fig — > J f(s)fig(ds) attains a maximum 
and a minimum over the domain I(fig). In the same way, let I(fi K g) be 
the set of all the invariant probabilities fi K g of (p K (s,e,0). Finally, for ev- 
ery sequence of shocks u) = {e n }n>i and initial condition so, define re- 
cursively the sample paths {s^(sq, lj, 9)} n >\ and {s Kn (so,uj, 9)} n >i gener- 
ated by functions (p K and tp K as s^(sq,uj,9) = ip K (s l ^_ l (so,uj,8),e n ,9) and 
s Kn (s ,u,9) = (p K (s Kn -i(s o ,u),0),e n ,0), for all n> 1. 

We next show that the range of variation of the average behavior of a 
typical simulated path {s^(so,uj, #)} n >i or {s Kn (so,uj, 9)} n >i is bounded by 
the corresponding expected values over the sets of invariant probabilities 
T(fif) and T(fi R6 ). 

Lemma 4.2 (A generalized law of large numbers). Under (A.l)— (A.5), 
for every fixed 9 in 0, 

1 N 

max / f(s)i4(ds) >limsup — V/«(s o ,w,0)) 

1 N 

(4.1) >liminf-> f(s 

Ktt 

N^oo iV z — ' 
n=l 

> min / f(s)fi K g(ds) 

for all so and 7 -almost all co. 

In other words, there exists a set ft with 7(fi) = 1 such that (4.1) is 
satisfied for all (so, u)) G Sx 0. If each of the functions (p K (-,-,9) and •, 0) 
has a unique invariant distribution, then Lemma 4.2 is a simple consequence 
of the law of large numbers of [4] together with (A. 2). 
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The foregoing lemmas are the main ingredients in the proof of the follow- 
ing uniform law of large numbers over the parameter space G. This result 
is key to substantiate the strong consistency of the SME. 



Proposition 4.3 (Uniform convergence of the simulated statistics). Un- 
der (A.1)-(A.5), there is a set with "[(fl) = 1 such that every pair (sq,uj) € 
S x Si satisfies the following property: For each e > 0, there is a constant 
N £ (lo) such that 



(4.2) 



1 N 



71=1 



< e 



for all N > N e (u) and all 6 inQ. 



Note that N e (oj) only depends on u, and hence it holds for all sq. This 
proposition could be restated in terms of the uniform convergence in of 
the sample distributions (e.g., see [20]), as the set of continuous functions 
f :S —> W is separable. 

Theorem 4.4 (Strong consistency of the SME). Assume that the process 
s = {s n } n >o is stationary and ergodic. Then under (A.1)-(A.5), for all so 
and \-almost all (w,s), the SME {0 n (sq,<jj,s)}n>\ converges to 9°. 

The SME is computationally costly and extensive model simulations must 
be performed to sample the region of feasible parameter values. A gain in 
computational efficiency should be obtained when some parameter values 
are constrained to take on certain values or can be estimated by more prac- 
tical methods. For instance, let 6 = (61,62) and suppose that the second 
component 62 can be estimated by traditional methods. Then similar con- 
sistency results can be established for vector 9% under the presumption that 
the estimator for vector 62 is consistent. 

The consistency of the estimator can also be extended to numerical ap- 
proximations. In most dynamical models, the solution <p does not admit 
an analytical representation, but it is usually possible to perform functional 
evaluations of a numerical approximation. Most numerical methods can gen- 
erate sequences of solutions {y J }j>i that converge to the original function 
ip as we refine the approximation scheme. Hence, it is of interest to know 
asymptotic properties of the estimator as the numerical error vanishes. 

As in (3.1), for every approximate function (p 3 a solution 9 3 is attained 
over the parameterized family of invariant probabilities. More specifically, 

(4.3) ^ = argminG(y f(s)4(ds),f\ 



SIMULATION-BASED ESTIMATOR 



9 



where [£ is an invariant probability of mapping </??(•, -,#). The invariant 
probability of [£ may not be unique, even though for each 9 the origi- 
nal mapping ip(-, -,9) is assumed to have a unique invariant probability 
Also, the solution 9 3 may not be unique. The idea is that certain primitive 
assumptions (cf. [2] and [13]) may guarantee uniqueness of the invariant 
distribution fig of ip(-,-,9) but uniqueness is not generally preserved under 
numerical perturbations of the model such as piecewise linear or polynomial 
interpolations. Hence, problem (4.3) should be understood as a minimization 
over the correspondence of invariant distributions /ig. 

Theorem 4.5 (Consistency of the SME for numerical approximations). 
Let (p 3 satisfy (A.l) and (A. 3) for every j. Let <p satisfy (A.l) and (A. 3)- 
(A.5). Assume that the sequence of functions {^(-, ',6)}j>i converges uni- 
formly to (/?(-, -,9) overO in the metric (2.2). Then every sequence of optimal 
solutions {^ J }j>i of (4-3) must converge to the original solution 9° of (3.1). 

Obviously, Theorems 4.4 and 4.5 can be combined to obtain convergence 
of the estimator in both N and j. Note that Theorem 4.5 does not depend on 
monotonicity condition (A. 2) since we are assuming the uniform convergence 
of the sequence of functions {cp 3 (•, •, #)}j>i • But as in the proof of Proposition 
4.3, condition (A. 2) can be instrumental to build a sandwich argument to 
establish the uniform convergence of {<^(-, ■, 6)}j>i over 9 in from the 
pointwise convergence of these functions for each 9. 

5. Remarks. The following issues may deserve further discussion: 

(i) Latent variables. The state space S may contain both observable and 
unobservable state variables. For instance, assume that s = (s\, S2) is a vector 
in 1Z 2 such that s± represents the value of production or output, S2 represents 
the level of efficiency or productivity, and e is a productivity shock. Usually, 
both S2 and e are unobservable. But function / may encompass enough 
moments or characteristics of variable s\ so as to identify the whole vector 
of parameters 9. Several papers (e.g., see [24, 28, 33] for some early examples) 
have stressed the importance of simulation-based estimation for models with 
unobservable or so-called "latent variables." 

(ii) Monotonicity. Our results on the consistency of the SME follow from 
relatively simple assumptions that are easy to check in applications. The 
monotonicity condition (A. 2) plays a key role in our arguments, and it is 
the most restrictive assumption. 4 All other regularity assumptions are dic- 
tated by technical considerations and are less limiting in applications. As 



4 Monotonicity can be weakened under specific functional forms. Let s = (si,S2) and 
8 = (81,62). Assume that si = fi(si, s%, 8) and S2 = ^2(52, 82)- Now, if there is an unbiased 
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compared to [11], we dispense with some strong assumptions such as geo- 
metric ergodicity, a global modulus of continuity condition, and damping 
conditions on the dynamics of the system. All these assumptions may be 
hard to check in applications. 

(iii) Ergodic processes. Our results could be extended beyond Markov pro- 
cesses, but stronger continuity conditions on mapping (p may be required. 
For instance, [8], Proposition 6.21, derives a generalized law of large numbers 
that it is suitable for extensions of Lemma 4.2 to ergodic processes under 
the more restrictive assumption of continuity of mapping (/?(•, e, 9) : S — >■ S. 

(iv) Continuity in the vector of parameters. Although function ip(s,e,9) 
may not be continuous in s, our method of proof relies on some type of 
continuity of ip(s, e, 9) in 9. A natural approach would be to require the con- 
tinuity of cp(s,e,9) in 9 under the distance function d(ip(-,-,9),p(-,-,9')) in 
(2.2). This metric is suitable for our purposes, since by (A. 3) discontinuities 
of cp(s, e, 9) in s can be smoothed out when integrating over e. A related no- 
tion of continuity is assumed in (A. 4) under the majorizing and minorizing 
functions (p K and (p K . This construction has been useful to circumvent some 
measurability problems. To motivate the definition of functions ip K and ip K 
the following simple example may be helpful. Suppose that s, e and 9 are 
scalars. Let <p(s,e, 9) = s + e + 9 if s + e + 9 < 2, and ip(s,e,9) = s + e + 9 + 5 if 
s + e + 9 > 2. That is, this function is increasing, and has a jump after reach- 
ing a certain threshold. (Observe that a sufficient condition for the Feller 
property to be satisfied is that e has a continuous density.) Consider now the 
majorizing function (p K (s,e,9) = ip(s,e,6) + Ke. Then (p K > ip. However, the 
set {9'\ip K (-, •, 9) > ip(-,-,9') > ip(-,-,9)} is empty. Note that the majorizing 
function in (A. 4) is (p K (s,e,9) = ip(s',e,9) + Ke where s' = s + Ke. Here, the 
set {9' \ip K (■ , ■ , 9) > ip(-,-,9') > <£>(-,■,#)} contains an open set of parameters 
9'. 

(v) Feller property. [2] and [10] dispense with the Feller property by in- 
troducing a mild mixing condition that guarantees existence, uniqueness 
and global stability to the invariant probability. Under the mixing condition 
the random dynamical system is a contraction mapping in a suitable metric 
space of probabilities. The resulting metric topology, however, is too fine 
to allow for continuous perturbations of the parameter space. For instance, 
when applied to a probability measure \x the above restricted perturba- 
tion ip K (s,e,9) = (p(s,e,9) + Ke may not vary continuously with k over the 
distance function (2.4) of [2]. Therefore, in the absence of further specific 
conditions (e.g., [11]) the Feller property seems indispensable for the strong 
consistency of our simulation-based estimator. 



estimator for 82, consistency of the SME for 9\ can be insured by monotonicity of cpi in 
s%. In the neoclassical growth model, S2 = ^2(52, #2) corresponds to the law of motion of 
the exogenous technological progress, and monotonicity of si = (pi(si,S2,8) in si follows 
from the concavity of the utility and production functions. 
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6. Applications. Several dynamic optimization problems generate non- 
continuous, monotone decision rules. Simple discontinuities for monotone 
Markov processes are often encountered in models of economic growth and 
finance, models of firm entry, patent races, replacement of durable goods, job 
search, marriage and fertility; e.g., see [3, 17, 23], and the aforementioned 
papers in (i) of the previous section. In these models, the optimal decision 
can feature isolated jumps because of discrete choices or lack of convexity 
but the Feller property may nevertheless be satisfied. 

Our purpose here is to estimate some critical parameters of a simplified 
version of the stock market model of [18]. This model borrows several el- 
ements from [7] and [26] who are concerned with the effects of technology 
adoption on economic growth and business fluctuations rather than on fi- 
nancial markets. None of these three papers consider model estimation, and 
simply report simulations for various benchmark calibrations of the model. 



6.1. The model. The representative household has preferences over con- 
sumption c of an aggregate good and desutility of work I as represented by 
the expected discounted objective: 



(6.1) Eq 



. t=o 



Hct) k 



i + x 



with < (3 < 1 and x > 0. For given initial quantity of the aggregate asset, 
a, the optimization problem faced by this agent is to choose a stochastic se- 
quence of consumption, labor and shares of the aggregate stock {ct,k, «t}t>o 
that maximizes the objective in (6.1) subject to the sequence of budget con- 
straints 

(6.2) c t + q t a t = w t l t + (qt + d t )a t -i 

with at > 0. Observe that qt denotes the market price of one stock unit, d± 

denotes the dividend, and wt is the wage, for t = 0, 1, 

There is a mass of At intermediate goods that are bundled together into 
a composite good M t defined by a CES technology, M t = [j^ mt(j) 1 ^ dj}® 
where m(j) denotes the amount of intermediate good j G [0, A t ] bought by 
the firm and $ > 1. The firm producing the final good accumulates capital 
k and buys labor I and a bundle of intermediate goods M using production 
function y = 9 t {k?l 1 t - a ) l -^M?. Output can be devoted to consumption or 
investment, and capital depreciates at a rate 5 > 0. Total factor productivity 
Qt is governed by the following law of motion: log# t = tpe ln^_i+ef , for e\ ~ 
N(0,o~q). The firm chooses stochastic sequences of investment, labor and 
intermediate goods 77i*0')je[o ^ t ])*>o t° maximize expected discounted 

revenues. 
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Monopolistic competition prevails in the intermediate goods market. That 
is, producer of intermediate good j selects both the optimal quantity m(j) 
to sell and corresponding price p(j) — taken as given prices and quantities set 
up by all other producers of intermediate goods. Following [26], we postulate 
a very simple technology: Production of one unit of good j just requires one 
unit of the final good. Then, the profit at time t of the producer of variety 
j is given by 

(6.3) 7r t (j) = max{p t (j)ml(j) - m\(j)}. 

mt(j) 

Without loss of generality, we suppose that irt(j) is the same for all j. Each 
intermediate good may eventually be displaced from the market. Let eft be 
the probability of survival of good j. Let rt be the stochastic rate of interest 
at time t. Then for rjf = (l + n+i) • • ■ (1 + r s ) the present value Vt of operating 
each technology from the beginning of time t is defined as: 

(6.4) V t ~- 

Technological innovations arrive exogenously to the economy. The total stock 
of technological innovations Zt evolves according to the law of motion 

(6.5) Z t = <j)Zt-i+x t 
with 

(6.6) \ux t = ^p x hixt-i + e^ ^€(0,1), e% ~ N(0, a x ). 

These functional forms are considered here for simplicity. Indeed, we could 
allow for discontinuities in variables Z and x to reflect sudden changes in 
the transmission of new technologies. These technologies are put into use 
by local adopters. The adoption sector behaves as a perfectly competitive 
market. Let At be the stock of already adopted technologies, and \(H) the 
probability of adoption of a new technology after investing the amount of 
resources H. Then 

(6.7) A t+1 = X(H t )[Z t -A t ] + Mt. 

The optimal amount of expenditure Ht is derived from the following Bell- 
man equation in which the value function is the option value Jt of a new 
technology. 5 

(6.8) J t = maxl-Ht + <PE t [— i (\(H t )V t+1 + (1 - X(H t ))J t+1 ] 



As is well known, this equation can be computed recursively by the method of succes- 
sive approximations. 
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6.2. Equilibrium and asset prices. In our model, the exogenous state 
variables are the stock of technological innovations Zt and Xt and the value 
of total factor productivity 9t, and the endogenous state variables are the 
amount of capital kt and the stock of adopted technologies At . The remaining 
variables are determined endogenously as solutions to the model under (6.1)- 
(6.8) and the market clearing conditions. Let us assume that at = 1 so that qt 
corresponds to the value of the stock market. This value can be decomposed 
into the value of existing stocks plus the option value of current and future 
technological innovations. 

Proposition 6.1. The stock market value qt can be computed as 



Hence, the value of the stock market is given by the sum of four compo- 
nents: The replacement cost of installed capital, the value of adopted tech- 
nologies, the option value of technological innovations currently available 
but not yet implemented, and the present value of technological innovations 
expected to occur. Most economic models identify the stock market value qt 
with the capital stock kt+i- 

6.3. Computation, calibration and estimation of the model. The model 
can be solved numerically from the first-order conditions of the above op- 
timization problems and the market clearing conditions. We compute these 
equations using a low degree perturbation method; see [18]. Various pa- 
rameters of the model are calibrated to match some empirical statistics of 
medium-term fluctuations and the volatility of patents. 6 But for reasons 
that will become clear below, we apply our simulation-based estimator to 
three critical parameters: %, 7 and $. 

Following the economics literature (e.g., see [7] and references therein) we 
choose values for the set of parameters ((3, a, 5) to match US macro data. 7 
Parameter /3 is fixed at 0.95 so that the real rate of return of investment 
in the deterministic steady state is equal to the average real return on the 



6 Following [7] medium-term cycles are defined as those within a frequency band of 2-50 
years. The data are filtered using the band-pass filter of [5]. 

7 We consider output, hours, labor productivity and TFP for the nonfarm business 
sector. The source is the Bureau of Labor Statistics (BLS). Consumption is measured 
as the sum of nondurables and services and investment is nonresidential. Both series are 
obtained from the Bureau of Economic Analysis (BEA). Each variable is transformed in 
per capita terms using the population aged 15-64. The data sample spans from 1948 to 



(6.9) 




where X 



t — 



2004. 
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S&P index over the period 1948-2004. Parameter a is set at 0.3 to match 
the average income share of labor costs, and the annual depreciation rate 8 
is set at 0.075. 

As a proxy for the number of adopted technologies At we use the percent- 
age deviation from its exponential trend of the number of patents issued. 8 
The parameter values for the law of motion of variables 6 and Z are selected 
to approximate the variance, correlation, and first-order autocorrelation of 
total factor productivity and patents over medium-term cycles in the data. 

Following [7], we set the value of <j) to 0.97 and the steady state value of A 
to 0.1. We assume that the probability of adoption takes on the functional 
form 

(6.10) \ t {H t )=k(^-H t 

with constant A > and p € (0,1). We estimate that the average share of 
adoption expenditures over sales 9 is 0.019 for the period 1970-1998. Then, 
in the steady state solution, p must be 0.11. 

We are then left with the estimation of parameters x, 7 and There 
are several estimates of the elasticity of individual labor supply Xi but these 
estimates do not seem adequate for our aggregate model since hours worked 
fluctuate much less than in the data. Regarding the share of intermediate 
goods 7, there are empirical estimates for the industrial sector (e.g., [19]), 
but our measure of intermediate goods is much broader. Along the same 
lines, there are estimates of the mark-up parameter t? for various sectors 
[27], but our model includes a very stylized cost function for the production 
of intermediate goods and there are various nontangible intermediate goods 
that may be expensed as patents and trademarks. 

We define the objective function of our estimator as 

~ {&inv &inv) H ~ (&hours 0~hours) 



(6.11) 

+ ~ (o stock ' -Vstock) , 

v stock 

where the standard deviations of the model a are compared against stan- 
dard deviations of the data a for variables inv = investment, hours = hours 
worked, and stock = stock market value. These deviations are weighted by 



The data come from the total number of utility patents granted from the US Patent 
and Trademark Office for 1970-2004, and from Historical Statistics of the United States 
series W-99 for 1948-1970. 

9 This measure is estimated as the average ratio of nonfederally funded development ex- 
penditures (a subset of R&D expenditures) over net sales for R&D-performing companies 
from the National Science Foundation. 



SIMULATION-BASED ESTIMATOR 



15 



the inverse of the standard deviation of a of each corresponding variable. 
Note that each a is a function of parameter values. The objective is mini- 
mized over the Euclidean product of interval values: % G [0, 1], 7 G [0.1,0.60] 
and 1? G [1.1, 2.2] under a inv = 8.86, o hours = 3.31, a stock = 31.41, and a &inv = 
0.0091, =0.0035, and & & . . =0.0315. 

7 ^ hours ' u stock 

The minimum is achieved at the following parameters: x = 0, 7 = 0.322 
and # = 1.4, with values for the objective: Oi nv = 10.44, odours = 1-46 and 
a s tock = 5.34. Therefore, one main conclusion from this exercise is that in this 
model the arrival of new technologies can at most account for approximately 
one sixth of the variability in the stock market (i.e., o stock = 5.34 vs. o stock = 
31.41), whereas we have checked that standard versions of the neoclassical 
model can only account for approximately one tenth of this variability. Also, 
the model cannot account for over half of the volatility of hours [a hours = 1-46 
vs. (J hours = 3.31), which of course may suggest that some labor market 
frictions must play an important role. 

To have a better view of the variability of stock markets, in future research 
we are planning to consider some other variables such as debt (leverage), 
liquidity constraints, taxes and subsidies, other production functions and 
markup policies, and monetary and financial shocks. Of course, the volatility 
of stock markets at present times may be due to lack of confidence and 
collateral requirements, but these latter variables may have played a much 
smaller role in our period of analysis. 

7. Proofs. 

Proof of Lemma 4.1. Let <p K ^(s,e, 9) = (p(s',e, 9) for s' = proj s [s + 

Kje] for every Kj > for j = 1,2, Let \& k j(s,0) = J Cp K i> (s,e,9)Q(de) and 

*(s,0) = / <p(s,e,6)Q(de). Then 



By (A. 3), mapping ^(-, 9) is continuous in s. As S is a compact set, 9) is 
uniformly continuous. Hence, the sequence of functions '(•, 9)}j>i must 
converge uniformly to ^(-,9) over S as Kj goes to zero. Further, by (A. 2), 
we have that (p Kj (s,e,9) > f(s,e,9) for all s and kj > 0. Then from (7.1) 
and the aforementioned convergence of the sequence '(•, 9)}j>x the se- 
quence {if Kj '(•, •, 9)}j>± must converge to ip(-,-,9) in the metric (2.2) as Kj 
goes to zero. Since ip K i (s, e, 9) = (p K i (s, e, 9) + Kje, the sequence of functions 
{(p Kj (-, -,9)}j>i must converge to (p(-, -,9) in the metric (2.2). Therefore, the 
corresponding sequence of invariant probabilities {l^g 3 }j>i must converge 
weakly to fig; see [30], Theorem 2. By the same argument, we can establish 
the weak convergence of the sequence of invariant probabilities {fJ, Kj 9}j>l- 
The proof is complete. □ 



(7.1) 
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Proof of Lemma 4.2. Prom the proof of the preceding lemma, it be- 
comes clear that both functions ip K and ip K satisfy (A. 3). Then the first and 
third inequalities in (4.2) follow from [30], Theorem 3. The second inequality 
follows from (A. 2) and the fact that ip K > ip K . □ 

Proof of Proposition 4.3. Since a countable union of sets of measure 
zero has measure zero, it suffices to establish that for a fixed rational number 
e > there is a set Cl with 7(fi) = 1 such that for every sq and wdiiwe can 
find N e (u) so that 

1 - 

n=l 



(7.2) 



< e 



for all N > N e (co) and all 9 in 9. 

As is compact, by Lemma 4.1 we can cover this set by a finite number 
of open neighborhoods V Kj (9j) for points {9j} such that for all j = 1, ... , J, 

(7.3) max f (s) (igUds) - min f(s)n K e (ds)<^-. 

By (A. 2) and the definition of the functions (p Kj (s,e,0j) and ip K .(s,£,6j), for 
all 9 € V Kj (9j) and all N > 1 the following inequalities must hold true 



1 N 1 N 

(«O,W,0j)) > —J2f(s n (s ,U,9)) 



(7.4) 



N ^— ' ^»->-j» - N 

n=l n=l 



1 N 

- AT XT /( s « 3 «( s 0> w i %)) 



AT 

n=l 



and 

(7.5) / f(s)fi K e Hds) > I f(s)fi 6 (ds) > I f(s)v Ki eAds) 



for all invariant distributions fig 1 and h^q^ . Moreover, by Lemma 4.2 there 
is a set Clj with 7(f2j) = 1 such that for each (sq,u) € S 1 x tij and f there is 
Ni (w) such that 

. j iV 

(7.6) K max K / f(s)$ (ds) + \ > -J] /(# ( So , w, %)) 



and 

iV 



7V n=l ^e^X^.e.) J 



c 
2 
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for all N > Ni(u). Let Cl = f)j =1 flj and N e (w) = max{Ni (u)}j =1 . Then, 

combining inequalities (7.3)-(7.7) we get that for every so and every oj £ Cl 
condition (7.2) must hold true for all N > jV e (w). □ 

Proof of Theorem 4.4. By assumption, the process s = {s n }n>o is 
stationary and ergodic. Hence, {-k ^2^=1 /00}-W>i converges (a.s.) to /. 
Then Theorem 4.4 is an immediate consequence of Proposition 4.3 and the 
following assumptions: (i) The space of parameters is a compact set, (ii) 
the maximizer 9° in (3.1) is unique and (iii) the sequence of continuous 
functions {Gn(-, •, so, w)}at>i converges uniformly to continuous function 
G(-, •) for all so and almost all u. □ 

Proof of Theorem 4.5. Observe that by assumption the sequence 
{ip 3 '(■, •, 0)}j>i converges uniformly to ip(-,-,9) in the metric d(-, •) of (2.2). 
Hence, by (A. 4) and a simple application of the triangle inequality, we get 
that given 5 > for every 9 there are a neighborhood V(0) and a constant 
J such that d(<p(; -,9),<p>(; -,9')) < 5 all 9' € V(0) and all j > J. Moreover, 
for every function / and e > this neighborhood V{9) and 5 > can be 
chosen small enough so that | f f(s)/j,o(ds) — j f(s)fi 3 e ,(ds) \ < e for all j > J 
and all 9' G V(9), see [30], Corollary 1. It is now easy to see that the se- 
quence {G(J f(s)ji 3 e (ds)J)}j> 1 converges uniformly to G(J f(s)(i$(ds),f). 
Therefore, the corresponding sequence of minimizers {9 J }j>\ must converge 
to 9°. □ 

Proof of Proposition 6.1. This equation for the value of the stock 
market is obtained from standard arguments using the no-arbitrage condi- 
tions along an equilibrium path (e.g., see [18]). □ 
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